SSML Extensions Aimed To Improve Asian Language TTS Rendering
نویسندگان
چکیده
Both formant synthesis based and concatenative acoustic unit based TTS systems have been developled in Nokia. Many non-English languages have been considered in the development work, and Nokia's Mandarin Chinese TTS system is under continuous development within the TC-STAR framework (www.tc-star.org). To meet the needs of the TTS evaluations in TC-STAR, common interfaces for the input and all the internal modules have been carefully defined. SSML has been taken into use as the input format, and Nokia has proposed extensions related to the Asian language peculiarities.
منابع مشابه
Implementing an SSML compliant concatenative TTS system
The W3C Speech Synthesis Markup Language (SSML) unifies a number of recent related markup languages that have emerged to fill the perceived need for increased, and standardized, user control over Text to Speech (TTS) engines. One of the main drivers for markup has been the increasing use of TTS engines as embedded components of specific applications – which means they are in a position to take ...
متن کاملMultilayered extensions to the speech synthesis markup language for describing expressiveness
In this paper we discuss possible extensions to the Speech Synthesis Markup Language (SSML) to facilitate the generation of synthetic expressive speech. The proposed extensions are hierarchical in nature, allowing specification in terms of physical parameters such as instantaneous pitch, higher-level parameters such as ToBI labels, or abstract concepts such as emotions. Low-level tags tend to c...
متن کاملA Corpus-based Approach to <ahem/> Expressive Speech Synthesis
Human speech communication can be thought of as comprising two channels – the words themselves, and the style in which they are spoken. Each of these channels carries information. Today's most-advanced text-to-speech (TTS) systems such as [1],[2],[3],[4] fall far short of human speech because they offer only a single, fixed style of delivery, independent of the message. In this paper, we descri...
متن کاملSSML Goes International – A Standard Story
Since September 2004, the SSML 1.0 [1] specification has been a W3C Recommendation. SSML is the standard way that a Voice Browser controls speech synthesis engine. Given that it is a standard, actions to define the language of the text to be rendered, to change between several voices, to insert pauses, to perform simple text normalization (e.g. acronym expansions, such as reading W3C as “World ...
متن کاملTowards Synthesis of Focus in Mandarin Text-to-speech System
This paper introduces the significance of synthesis of focus in Mandarin text-to-speech (TTS) system, as well as the key challenges in research on synthesis of focus. The proposal on the extension of Speech Synthesis Markup Language (SSML) is presented for the improvement of intelligibility of key words or phrases, and also demonstrated by an example finally.
متن کامل